Overview
Brought to you by YData
Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 4915 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 691.3 KiB |
| Average record size in memory | 144.0 B |
Variable types
| Numeric | 14 |
|---|---|
| Categorical | 4 |
annualHomeownersInsurance is highly overall correlated with bathrooms and 8 other fields | High correlation |
bathrooms is highly overall correlated with annualHomeownersInsurance and 4 other fields | High correlation |
bedrooms is highly overall correlated with annualHomeownersInsurance and 6 other fields | High correlation |
cluster is highly overall correlated with annualHomeownersInsurance and 11 other fields | High correlation |
cluster_average_price is highly overall correlated with annualHomeownersInsurance and 11 other fields | High correlation |
countyFIPS is highly overall correlated with cluster and 1 other fields | High correlation |
homeType_CONDO is highly overall correlated with annualHomeownersInsurance and 8 other fields | High correlation |
homeType_SINGLE_FAMILY is highly overall correlated with annualHomeownersInsurance and 5 other fields | High correlation |
latitude is highly overall correlated with cluster and 1 other fields | High correlation |
livingArea is highly overall correlated with annualHomeownersInsurance and 7 other fields | High correlation |
longitude is highly overall correlated with cluster and 1 other fields | High correlation |
monthlyHoaFee is highly overall correlated with homeType_CONDO | High correlation |
price is highly overall correlated with annualHomeownersInsurance and 8 other fields | High correlation |
propertyTaxRate is highly overall correlated with cluster and 1 other fields | High correlation |
rentZestimate is highly overall correlated with annualHomeownersInsurance and 7 other fields | High correlation |
zipcode is highly overall correlated with cluster and 1 other fields | High correlation |
monthlyHoaFee has 3943 (80.2%) zeros | Zeros |
Reproduction
| Analysis started | 2024-12-17 00:39:16.993197 |
|---|---|
| Analysis finished | 2024-12-17 00:39:41.888805 |
| Duration | 24.9 seconds |
| Software version | ydata-profiling vv4.12.0 |
| Download configuration | config.json |
Variables
longitude
Real number (ℝ)
High correlation 
| Distinct | 3673 |
|---|---|
| Distinct (%) | 74.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -149.2102 |
| Minimum | -150.01093 |
|---|---|
| Maximum | -70.4831 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 4915 |
| Negative (%) | 100.0% |
| Memory size | 38.5 KiB |
Quantile statistics
| Minimum | -150.01093 |
|---|---|
| 5-th percentile | -149.95921 |
| Q1 | -149.92895 |
| median | -149.87314 |
| Q3 | -149.81379 |
| 95-th percentile | -149.72975 |
| Maximum | -70.4831 |
| Range | 79.52783 |
| Interquartile range (IQR) | 0.115155 |
Descriptive statistics
| Standard deviation | 6.5336835 |
|---|---|
| Coefficient of variation (CV) | -0.043788452 |
| Kurtosis | 102.47482 |
| Mean | -149.2102 |
| Median Absolute Deviation (MAD) | 0.05764 |
| Skewness | 10.114499 |
| Sum | -733368.11 |
| Variance | 42.68902 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -149.73212 | 13 | 0.3% |
| -149.94147 | 13 | 0.3% |
| -149.92009 | 12 | 0.2% |
| -149.89989 | 12 | 0.2% |
| -149.72975 | 11 | 0.2% |
| -149.82962 | 10 | 0.2% |
| -149.89291 | 8 | 0.2% |
| -149.88034 | 8 | 0.2% |
| -149.89194 | 7 | 0.1% |
| -149.9455 | 7 | 0.1% |
| Other values (3663) | 4814 |
| Value | Count | Frequency (%) |
| -150.01093 | 1 | |
| -150.0097 | 1 | |
| -150.00902 | 1 | |
| -150.00879 | 1 | |
| -150.00493 | 1 | |
| -150.0028 | 1 | |
| -150.00247 | 1 | |
| -150.00201 | 1 | |
| -150.00186 | 1 | |
| -150.00182 | 1 |
| Value | Count | Frequency (%) |
| -70.4831 | 1 | |
| -70.483406 | 2 | |
| -71.42977 | 1 | |
| -72.25583 | 1 | |
| -73.171455 | 1 | |
| -73.702896 | 1 | |
| -73.819725 | 1 | |
| -73.893456 | 1 | |
| -75.349976 | 1 | |
| -75.40257 | 1 |
countyFIPS
Real number (ℝ)
High correlation 
| Distinct | 36 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2249.4529 |
| Minimum | 2020 |
|---|---|
| Maximum | 55079 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.5 KiB |
Quantile statistics
| Minimum | 2020 |
|---|---|
| 5-th percentile | 2020 |
| Q1 | 2020 |
| median | 2020 |
| Q3 | 2020 |
| 95-th percentile | 2020 |
| Maximum | 55079 |
| Range | 53059 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2665.8739 |
|---|---|
| Coefficient of variation (CV) | 1.185121 |
| Kurtosis | 200.81946 |
| Mean | 2249.4529 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 13.610229 |
| Sum | 11056061 |
| Variance | 7106883.8 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2020 | 4864 | |
| 12015 | 4 | 0.1% |
| 26125 | 3 | 0.1% |
| 6065 | 3 | 0.1% |
| 34005 | 2 | < 0.1% |
| 29095 | 2 | < 0.1% |
| 10005 | 2 | < 0.1% |
| 37071 | 2 | < 0.1% |
| 6071 | 2 | < 0.1% |
| 55079 | 2 | < 0.1% |
| Other values (26) | 29 | 0.6% |
| Value | Count | Frequency (%) |
| 2020 | 4864 | |
| 6059 | 1 | < 0.1% |
| 6065 | 3 | 0.1% |
| 6071 | 2 | < 0.1% |
| 9001 | 2 | < 0.1% |
| 10005 | 2 | < 0.1% |
| 12011 | 1 | < 0.1% |
| 12015 | 4 | 0.1% |
| 12021 | 1 | < 0.1% |
| 12071 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 55079 | 2 | |
| 48491 | 1 | |
| 48245 | 1 | |
| 42095 | 1 | |
| 42077 | 1 | |
| 42049 | 1 | |
| 40037 | 1 | |
| 39155 | 1 | |
| 37071 | 2 | |
| 37023 | 1 |
monthlyHoaFee
Real number (ℝ)
High correlation  Zeros 
| Distinct | 203 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 46.618922 |
| Minimum | 0 |
|---|---|
| Maximum | 873 |
| Zeros | 3943 |
| Zeros (%) | 80.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 346 |
| Maximum | 873 |
| Range | 873 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 118.04396 |
|---|---|
| Coefficient of variation (CV) | 2.5321041 |
| Kurtosis | 7.4911907 |
| Mean | 46.618922 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.7362757 |
| Sum | 229132 |
| Variance | 13934.377 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3943 | |
| 18 | 29 | 0.6% |
| 25 | 28 | 0.6% |
| 90 | 24 | 0.5% |
| 300 | 22 | 0.4% |
| 400 | 21 | 0.4% |
| 115 | 20 | 0.4% |
| 265 | 19 | 0.4% |
| 272 | 17 | 0.3% |
| 338 | 16 | 0.3% |
| Other values (193) | 776 | 15.8% |
| Value | Count | Frequency (%) |
| 0 | 3943 | |
| 4 | 13 | 0.3% |
| 6 | 5 | 0.1% |
| 8 | 5 | 0.1% |
| 10 | 2 | < 0.1% |
| 11 | 6 | 0.1% |
| 12 | 14 | 0.3% |
| 13 | 12 | 0.2% |
| 15 | 11 | 0.2% |
| 16 | 5 | 0.1% |
| Value | Count | Frequency (%) |
| 873 | 2 | < 0.1% |
| 830 | 1 | < 0.1% |
| 752 | 4 | |
| 691 | 1 | < 0.1% |
| 681 | 6 | |
| 664 | 1 | < 0.1% |
| 650 | 1 | < 0.1% |
| 634 | 9 | |
| 565 | 2 | < 0.1% |
| 563 | 1 | < 0.1% |
annualHomeownersInsurance
Real number (ℝ)
High correlation 
| Distinct | 2064 |
|---|---|
| Distinct (%) | 42.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1680.0374 |
| Minimum | 252 |
|---|---|
| Maximum | 8646 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.5 KiB |
Quantile statistics
| Minimum | 252 |
|---|---|
| 5-th percentile | 661 |
| Q1 | 1252 |
| median | 1631 |
| Q3 | 1972 |
| 95-th percentile | 2877.6 |
| Maximum | 8646 |
| Range | 8394 |
| Interquartile range (IQR) | 720 |
Descriptive statistics
| Standard deviation | 722.06046 |
|---|---|
| Coefficient of variation (CV) | 0.42978832 |
| Kurtosis | 7.8647068 |
| Mean | 1680.0374 |
| Median Absolute Deviation (MAD) | 360 |
| Skewness | 1.6992913 |
| Sum | 8257384 |
| Variance | 521371.31 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1630 | 14 | 0.3% |
| 1633 | 12 | 0.2% |
| 1743 | 11 | 0.2% |
| 1341 | 11 | 0.2% |
| 1691 | 11 | 0.2% |
| 1733 | 11 | 0.2% |
| 1696 | 10 | 0.2% |
| 1677 | 10 | 0.2% |
| 1762 | 9 | 0.2% |
| 1310 | 9 | 0.2% |
| Other values (2054) | 4807 |
| Value | Count | Frequency (%) |
| 252 | 1 | |
| 335 | 1 | |
| 366 | 1 | |
| 378 | 2 | |
| 382 | 1 | |
| 384 | 1 | |
| 388 | 1 | |
| 394 | 1 | |
| 397 | 1 | |
| 402 | 1 |
| Value | Count | Frequency (%) |
| 8646 | 1 | |
| 8004 | 1 | |
| 7710 | 1 | |
| 6539 | 1 | |
| 6274 | 1 | |
| 6273 | 1 | |
| 6143 | 1 | |
| 6071 | 1 | |
| 5982 | 1 | |
| 5830 | 1 |
yearBuilt
Real number (ℝ)
| Distinct | 89 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1976.0761 |
| Minimum | 1900 |
|---|---|
| Maximum | 2022 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.5 KiB |
Quantile statistics
| Minimum | 1900 |
|---|---|
| 5-th percentile | 1952 |
| Q1 | 1969 |
| median | 1977 |
| Q3 | 1983 |
| 95-th percentile | 1997 |
| Maximum | 2022 |
| Range | 122 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 12.988204 |
|---|---|
| Coefficient of variation (CV) | 0.0065727245 |
| Kurtosis | 1.1505688 |
| Mean | 1976.0761 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -0.12820139 |
| Sum | 9712414 |
| Variance | 168.69344 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1983 | 464 | 9.4% |
| 1982 | 339 | 6.9% |
| 1978 | 235 | 4.8% |
| 1975 | 208 | 4.2% |
| 1984 | 197 | 4.0% |
| 1974 | 190 | 3.9% |
| 1977 | 182 | 3.7% |
| 1972 | 181 | 3.7% |
| 1981 | 167 | 3.4% |
| 1976 | 162 | 3.3% |
| Other values (79) | 2590 |
| Value | Count | Frequency (%) |
| 1900 | 2 | < 0.1% |
| 1911 | 1 | < 0.1% |
| 1915 | 1 | < 0.1% |
| 1930 | 1 | < 0.1% |
| 1935 | 1 | < 0.1% |
| 1938 | 2 | < 0.1% |
| 1939 | 3 | |
| 1940 | 5 | |
| 1941 | 7 | |
| 1942 | 4 |
| Value | Count | Frequency (%) |
| 2022 | 1 | < 0.1% |
| 2021 | 2 | < 0.1% |
| 2020 | 13 | |
| 2019 | 6 | |
| 2018 | 6 | |
| 2017 | 2 | < 0.1% |
| 2016 | 2 | < 0.1% |
| 2015 | 3 | 0.1% |
| 2013 | 2 | < 0.1% |
| 2012 | 2 | < 0.1% |
latitude
Real number (ℝ)
High correlation 
| Distinct | 3992 |
|---|---|
| Distinct (%) | 81.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 60.915614 |
| Minimum | 26.004696 |
|---|---|
| Maximum | 61.231228 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.5 KiB |
Quantile statistics
| Minimum | 26.004696 |
|---|---|
| 5-th percentile | 61.123306 |
| Q1 | 61.14649 |
| median | 61.172867 |
| Q3 | 61.197457 |
| 95-th percentile | 61.219247 |
| Maximum | 61.231228 |
| Range | 35.226532 |
| Interquartile range (IQR) | 0.0509665 |
Descriptive statistics
| Standard deviation | 2.5643447 |
|---|---|
| Coefficient of variation (CV) | 0.042096672 |
| Kurtosis | 110.07436 |
| Mean | 60.915614 |
| Median Absolute Deviation (MAD) | 0.025723 |
| Skewness | -10.362467 |
| Sum | 299400.24 |
| Variance | 6.5758635 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 61.18444 | 13 | 0.3% |
| 61.21068 | 12 | 0.2% |
| 61.180492 | 12 | 0.2% |
| 61.222507 | 11 | 0.2% |
| 61.1795 | 10 | 0.2% |
| 61.202625 | 10 | 0.2% |
| 61.20454 | 10 | 0.2% |
| 61.182777 | 9 | 0.2% |
| 61.20415 | 9 | 0.2% |
| 61.219677 | 7 | 0.1% |
| Other values (3982) | 4812 |
| Value | Count | Frequency (%) |
| 26.004696 | 1 | |
| 26.10778 | 1 | |
| 26.527695 | 1 | |
| 26.899212 | 1 | |
| 27.510103 | 1 | |
| 28.249554 | 1 | |
| 28.95348 | 1 | |
| 29.595736 | 1 | |
| 29.986279 | 1 | |
| 30.6175 | 1 |
| Value | Count | Frequency (%) |
| 61.231228 | 1 | |
| 61.2311 | 1 | |
| 61.231094 | 1 | |
| 61.23106 | 1 | |
| 61.23081 | 1 | |
| 61.2308 | 1 | |
| 61.23066 | 1 | |
| 61.23053 | 1 | |
| 61.2305 | 1 | |
| 61.23045 | 1 |
rentZestimate
Real number (ℝ)
High correlation 
| Distinct | 2165 |
|---|---|
| Distinct (%) | 44.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2647.7137 |
| Minimum | 782 |
|---|---|
| Maximum | 7545 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.5 KiB |
Quantile statistics
| Minimum | 782 |
|---|---|
| 5-th percentile | 1595.4 |
| Q1 | 2181 |
| median | 2609 |
| Q3 | 3015 |
| 95-th percentile | 3928.5 |
| Maximum | 7545 |
| Range | 6763 |
| Interquartile range (IQR) | 834 |
Descriptive statistics
| Standard deviation | 721.58966 |
|---|---|
| Coefficient of variation (CV) | 0.27253311 |
| Kurtosis | 1.893501 |
| Mean | 2647.7137 |
| Median Absolute Deviation (MAD) | 418 |
| Skewness | 0.83486308 |
| Sum | 13013513 |
| Variance | 520691.63 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2804 | 12 | 0.2% |
| 1646 | 11 | 0.2% |
| 1645 | 11 | 0.2% |
| 2594 | 10 | 0.2% |
| 1744 | 10 | 0.2% |
| 1601 | 10 | 0.2% |
| 1565 | 9 | 0.2% |
| 1909 | 9 | 0.2% |
| 2369 | 9 | 0.2% |
| 1824 | 9 | 0.2% |
| Other values (2155) | 4815 |
| Value | Count | Frequency (%) |
| 782 | 1 | |
| 846 | 1 | |
| 903 | 1 | |
| 1046 | 1 | |
| 1083 | 1 | |
| 1089 | 1 | |
| 1110 | 1 | |
| 1125 | 1 | |
| 1174 | 1 | |
| 1175 | 1 |
| Value | Count | Frequency (%) |
| 7545 | 1 | |
| 6515 | 1 | |
| 6253 | 1 | |
| 6072 | 1 | |
| 6023 | 1 | |
| 5999 | 1 | |
| 5785 | 1 | |
| 5783 | 1 | |
| 5734 | 1 | |
| 5658 | 1 |
timeOnZillow
Real number (ℝ)
| Distinct | 2473 |
|---|---|
| Distinct (%) | 50.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3788.3434 |
| Minimum | 1 |
|---|---|
| Maximum | 19949 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1336 |
| Q1 | 3179 |
| median | 3781 |
| Q3 | 4398.5 |
| 95-th percentile | 6308 |
| Maximum | 19949 |
| Range | 19948 |
| Interquartile range (IQR) | 1219.5 |
Descriptive statistics
| Standard deviation | 1563.7836 |
|---|---|
| Coefficient of variation (CV) | 0.41278822 |
| Kurtosis | 8.6106925 |
| Mean | 3788.3434 |
| Median Absolute Deviation (MAD) | 614 |
| Skewness | 1.2218956 |
| Sum | 18619708 |
| Variance | 2445419 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3917 | 15 | 0.3% |
| 3758 | 14 | 0.3% |
| 3581 | 14 | 0.3% |
| 3644 | 13 | 0.3% |
| 3728 | 12 | 0.2% |
| 3666 | 11 | 0.2% |
| 4085 | 11 | 0.2% |
| 3532 | 11 | 0.2% |
| 4031 | 10 | 0.2% |
| 3882 | 10 | 0.2% |
| Other values (2463) | 4794 |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 3 | |
| 8 | 1 | < 0.1% |
| 9 | 2 | |
| 10 | 1 | < 0.1% |
| 11 | 3 | |
| 12 | 3 | |
| 13 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 19949 | 3 | |
| 11640 | 1 | < 0.1% |
| 11152 | 1 | < 0.1% |
| 11065 | 1 | < 0.1% |
| 10994 | 1 | < 0.1% |
| 10777 | 1 | < 0.1% |
| 10766 | 1 | < 0.1% |
| 10759 | 1 | < 0.1% |
| 10701 | 1 | < 0.1% |
| 10575 | 1 | < 0.1% |
livingArea
Real number (ℝ)
High correlation 
| Distinct | 1851 |
|---|---|
| Distinct (%) | 37.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1810.1776 |
| Minimum | 20 |
|---|---|
| Maximum | 8349 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.5 KiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 746 |
| Q1 | 1178.5 |
| median | 1720 |
| Q3 | 2150.5 |
| 95-th percentile | 3505.1 |
| Maximum | 8349 |
| Range | 8329 |
| Interquartile range (IQR) | 972 |
Descriptive statistics
| Standard deviation | 872.95082 |
|---|---|
| Coefficient of variation (CV) | 0.48224595 |
| Kurtosis | 4.7383456 |
| Mean | 1810.1776 |
| Median Absolute Deviation (MAD) | 498 |
| Skewness | 1.6027467 |
| Sum | 8897023 |
| Variance | 762043.14 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1040 | 67 | 1.4% |
| 1920 | 51 | 1.0% |
| 1824 | 47 | 1.0% |
| 1872 | 43 | 0.9% |
| 1728 | 39 | 0.8% |
| 1976 | 32 | 0.7% |
| 1200 | 31 | 0.6% |
| 988 | 27 | 0.5% |
| 1152 | 26 | 0.5% |
| 1344 | 23 | 0.5% |
| Other values (1841) | 4529 |
| Value | Count | Frequency (%) |
| 20 | 1 | |
| 320 | 1 | |
| 399 | 2 | |
| 400 | 1 | |
| 404 | 1 | |
| 415 | 1 | |
| 446 | 1 | |
| 450 | 1 | |
| 460 | 1 | |
| 470 | 1 |
| Value | Count | Frequency (%) |
| 8349 | 1 | |
| 7500 | 1 | |
| 7227 | 1 | |
| 7010 | 2 | |
| 7004 | 1 | |
| 6984 | 1 | |
| 6878 | 1 | |
| 6692 | 1 | |
| 6477 | 1 | |
| 6451 | 1 |
zipcode
Real number (ℝ)
High correlation 
| Distinct | 58 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98930.938 |
| Minimum | 2649 |
|---|---|
| Maximum | 99518 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.5 KiB |
Quantile statistics
| Minimum | 2649 |
|---|---|
| 5-th percentile | 99501 |
| Q1 | 99502 |
| median | 99507 |
| Q3 | 99515 |
| 95-th percentile | 99518 |
| Maximum | 99518 |
| Range | 96869 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 6281.3839 |
|---|---|
| Coefficient of variation (CV) | 0.063492615 |
| Kurtosis | 148.56915 |
| Mean | 98930.938 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -11.917762 |
| Sum | 4.8624556 × 108 |
| Variance | 39455784 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 99507 | 900 | |
| 99502 | 833 | |
| 99508 | 736 | |
| 99517 | 606 | |
| 99504 | 405 | |
| 99518 | 393 | |
| 99501 | 347 | 7.1% |
| 99515 | 248 | 5.0% |
| 99503 | 240 | 4.9% |
| 99516 | 156 | 3.2% |
| Other values (48) | 51 | 1.0% |
| Value | Count | Frequency (%) |
| 2649 | 3 | |
| 2865 | 1 | < 0.1% |
| 6415 | 1 | < 0.1% |
| 6607 | 1 | < 0.1% |
| 11355 | 1 | < 0.1% |
| 12118 | 1 | < 0.1% |
| 12524 | 1 | < 0.1% |
| 16509 | 1 | < 0.1% |
| 18015 | 1 | < 0.1% |
| 18018 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 99518 | 393 | |
| 99517 | 606 | |
| 99516 | 156 | 3.2% |
| 99515 | 248 | 5.0% |
| 99508 | 736 | |
| 99507 | 900 | |
| 99504 | 405 | |
| 99503 | 240 | 4.9% |
| 99502 | 833 | |
| 99501 | 347 | 7.1% |
propertyTaxRate
Real number (ℝ)
High correlation 
| Distinct | 45 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.308059 |
| Minimum | 0.57 |
|---|---|
| Maximum | 1.89 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.5 KiB |
Quantile statistics
| Minimum | 0.57 |
|---|---|
| 5-th percentile | 1.31 |
| Q1 | 1.31 |
| median | 1.31 |
| Q3 | 1.31 |
| 95-th percentile | 1.31 |
| Maximum | 1.89 |
| Range | 1.32 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.044278421 |
|---|---|
| Coefficient of variation (CV) | 0.033850477 |
| Kurtosis | 180.49494 |
| Mean | 1.308059 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -9.4320429 |
| Sum | 6429.11 |
| Variance | 0.0019605785 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.31 | 4864 | |
| 0.63 | 3 | 0.1% |
| 1.36 | 2 | < 0.1% |
| 1.57 | 2 | < 0.1% |
| 1.14 | 2 | < 0.1% |
| 0.66 | 2 | < 0.1% |
| 0.59 | 2 | < 0.1% |
| 1.66 | 1 | < 0.1% |
| 0.91 | 1 | < 0.1% |
| 1.24 | 1 | < 0.1% |
| Other values (35) | 35 | 0.7% |
| Value | Count | Frequency (%) |
| 0.57 | 1 | < 0.1% |
| 0.59 | 2 | |
| 0.61 | 1 | < 0.1% |
| 0.62 | 1 | < 0.1% |
| 0.63 | 3 | |
| 0.66 | 2 | |
| 0.67 | 1 | < 0.1% |
| 0.71 | 1 | < 0.1% |
| 0.72 | 1 | < 0.1% |
| 0.73 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1.89 | 1 | |
| 1.87 | 1 | |
| 1.85 | 1 | |
| 1.72 | 1 | |
| 1.69 | 1 | |
| 1.66 | 1 | |
| 1.57 | 2 | |
| 1.56 | 1 | |
| 1.52 | 1 | |
| 1.51 | 1 |
bathrooms
Real number (ℝ)
High correlation 
| Distinct | 23 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0909664 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 22 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1.5 |
| median | 2 |
| Q3 | 2.5 |
| 95-th percentile | 3.5 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.85312086 |
|---|---|
| Coefficient of variation (CV) | 0.40800314 |
| Kurtosis | 4.1007024 |
| Mean | 2.0909664 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 1.1306375 |
| Sum | 10277.1 |
| Variance | 0.7278152 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 1953 | |
| 1 | 925 | |
| 3 | 669 | 13.6% |
| 2.5 | 580 | 11.8% |
| 1.5 | 412 | 8.4% |
| 4 | 132 | 2.7% |
| 3.5 | 95 | 1.9% |
| 5 | 32 | 0.7% |
| 4.5 | 24 | 0.5% |
| 0 | 22 | 0.4% |
| Other values (13) | 71 | 1.4% |
| Value | Count | Frequency (%) |
| 0 | 22 | 0.4% |
| 0.5 | 10 | 0.2% |
| 1 | 925 | |
| 1.3 | 1 | < 0.1% |
| 1.5 | 412 | 8.4% |
| 1.75 | 15 | 0.3% |
| 1.8 | 1 | < 0.1% |
| 2 | 1953 | |
| 2.25 | 3 | 0.1% |
| 2.5 | 580 | 11.8% |
| Value | Count | Frequency (%) |
| 10 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 3 | 0.1% |
| 6.5 | 1 | < 0.1% |
| 6 | 11 | 0.2% |
| 5.5 | 8 | 0.2% |
| 5 | 32 | 0.7% |
| 4.5 | 24 | 0.5% |
| 4 | 132 | |
| 3.75 | 2 | < 0.1% |
bedrooms
Real number (ℝ)
High correlation 
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.202645 |
| Minimum | 0 |
|---|---|
| Maximum | 14 |
| Zeros | 7 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 14 |
| Range | 14 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.1627936 |
|---|---|
| Coefficient of variation (CV) | 0.36307291 |
| Kurtosis | 5.2726558 |
| Mean | 3.202645 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.243081 |
| Sum | 15741 |
| Variance | 1.3520891 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 2086 | |
| 4 | 1202 | |
| 2 | 985 | |
| 5 | 245 | 5.0% |
| 1 | 196 | 4.0% |
| 6 | 138 | 2.8% |
| 7 | 25 | 0.5% |
| 8 | 14 | 0.3% |
| 10 | 11 | 0.2% |
| 0 | 7 | 0.1% |
| Other values (2) | 6 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 7 | 0.1% |
| 1 | 196 | 4.0% |
| 2 | 985 | |
| 3 | 2086 | |
| 4 | 1202 | |
| 5 | 245 | 5.0% |
| 6 | 138 | 2.8% |
| 7 | 25 | 0.5% |
| 8 | 14 | 0.3% |
| 9 | 5 | 0.1% |
| Value | Count | Frequency (%) |
| 14 | 1 | < 0.1% |
| 10 | 11 | 0.2% |
| 9 | 5 | 0.1% |
| 8 | 14 | 0.3% |
| 7 | 25 | 0.5% |
| 6 | 138 | 2.8% |
| 5 | 245 | 5.0% |
| 4 | 1202 | |
| 3 | 2086 | |
| 2 | 985 |
price
Real number (ℝ)
High correlation 
| Distinct | 3178 |
|---|---|
| Distinct (%) | 64.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 400004.42 |
| Minimum | 60000 |
|---|---|
| Maximum | 2058500 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 38.5 KiB |
Quantile statistics
| Minimum | 60000 |
|---|---|
| 5-th percentile | 157300 |
| Q1 | 298150 |
| median | 388300 |
| Q3 | 469500 |
| 95-th percentile | 685190 |
| Maximum | 2058500 |
| Range | 1998500 |
| Interquartile range (IQR) | 171350 |
Descriptive statistics
| Standard deviation | 171919 |
|---|---|
| Coefficient of variation (CV) | 0.42979276 |
| Kurtosis | 7.8643862 |
| Mean | 400004.42 |
| Median Absolute Deviation (MAD) | 85800 |
| Skewness | 1.6992632 |
| Sum | 1.9660217 × 109 |
| Variance | 2.9556144 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 388100 | 7 | 0.1% |
| 419500 | 6 | 0.1% |
| 403900 | 6 | 0.1% |
| 375900 | 6 | 0.1% |
| 420900 | 6 | 0.1% |
| 462800 | 6 | 0.1% |
| 411600 | 6 | 0.1% |
| 394000 | 6 | 0.1% |
| 451800 | 5 | 0.1% |
| 388800 | 5 | 0.1% |
| Other values (3168) | 4856 |
| Value | Count | Frequency (%) |
| 60000 | 1 | |
| 79700 | 1 | |
| 87100 | 1 | |
| 90100 | 2 | |
| 91000 | 1 | |
| 91500 | 1 | |
| 92400 | 1 | |
| 93800 | 1 | |
| 94600 | 1 | |
| 95600 | 1 |
| Value | Count | Frequency (%) |
| 2058500 | 1 | |
| 1905800 | 1 | |
| 1835600 | 1 | |
| 1556900 | 1 | |
| 1493700 | 1 | |
| 1493600 | 1 | |
| 1462700 | 1 | |
| 1445400 | 1 | |
| 1424300 | 1 | |
| 1388100 | 1 |
homeType_CONDO
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.5 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 4178 | |
| 1 | 737 | 15.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 4178 | |
| 1 | 737 | 15.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4178 | |
| 1 | 737 | 15.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4915 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4178 | |
| 1 | 737 | 15.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4915 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4178 | |
| 1 | 737 | 15.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4915 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4178 | |
| 1 | 737 | 15.0% |
homeType_SINGLE_FAMILY
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.5 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 3605 | |
| 0 | 1310 | 26.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 3605 | |
| 0 | 1310 | 26.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3605 | |
| 0 | 1310 | 26.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4915 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 3605 | |
| 0 | 1310 | 26.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4915 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 3605 | |
| 0 | 1310 | 26.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4915 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 3605 | |
| 0 | 1310 | 26.7% |
cluster
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.5 KiB |
| 3 | |
|---|---|
| 0 | |
| 1 | |
| 2 | 51 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 3057 | |
| 0 | 1013 | 20.6% |
| 1 | 794 | 16.2% |
| 2 | 51 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 3057 | |
| 0 | 1013 | 20.6% |
| 1 | 794 | 16.2% |
| 2 | 51 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 3057 | |
| 0 | 1013 | 20.6% |
| 1 | 794 | 16.2% |
| 2 | 51 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4915 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 3057 | |
| 0 | 1013 | 20.6% |
| 1 | 794 | 16.2% |
| 2 | 51 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4915 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 3057 | |
| 0 | 1013 | 20.6% |
| 1 | 794 | 16.2% |
| 2 | 51 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4915 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 3057 | |
| 0 | 1013 | 20.6% |
| 1 | 794 | 16.2% |
| 2 | 51 | 1.0% |
cluster_average_price
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 38.5 KiB |
| 377817.8976789801 | |
|---|---|
| 624584.1379310344 | |
| 200110.15355805244 | |
| 422504.4117647059 | 51 |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 17.161546 |
| Min length | 17 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 624584.1379310344 |
|---|---|
| 2nd row | 624584.1379310344 |
| 3rd row | 624584.1379310344 |
| 4th row | 624584.1379310344 |
| 5th row | 624584.1379310344 |
Common Values
| Value | Count | Frequency (%) |
| 377817.8976789801 | 3057 | |
| 624584.1379310344 | 1013 | 20.6% |
| 200110.15355805244 | 794 | 16.2% |
| 422504.4117647059 | 51 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 377817.8976789801 | 3057 | |
| 624584.1379310344 | 1013 | 20.6% |
| 200110.15355805244 | 794 | 16.2% |
| 422504.4117647059 | 51 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 16400 | |
| 8 | 14035 | |
| 1 | 10624 | |
| 0 | 7348 | |
| 9 | 7178 | |
| 3 | 6890 | |
| 4 | 5844 | 6.9% |
| . | 4915 | 5.8% |
| 5 | 4291 | 5.1% |
| 6 | 4121 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 84349 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 7 | 16400 | |
| 8 | 14035 | |
| 1 | 10624 | |
| 0 | 7348 | |
| 9 | 7178 | |
| 3 | 6890 | |
| 4 | 5844 | 6.9% |
| . | 4915 | 5.8% |
| 5 | 4291 | 5.1% |
| 6 | 4121 | 4.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 84349 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 7 | 16400 | |
| 8 | 14035 | |
| 1 | 10624 | |
| 0 | 7348 | |
| 9 | 7178 | |
| 3 | 6890 | |
| 4 | 5844 | 6.9% |
| . | 4915 | 5.8% |
| 5 | 4291 | 5.1% |
| 6 | 4121 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 84349 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 7 | 16400 | |
| 8 | 14035 | |
| 1 | 10624 | |
| 0 | 7348 | |
| 9 | 7178 | |
| 3 | 6890 | |
| 4 | 5844 | 6.9% |
| . | 4915 | 5.8% |
| 5 | 4291 | 5.1% |
| 6 | 4121 | 4.9% |
Interactions
Correlations
| annualHomeownersInsurance | bathrooms | bedrooms | cluster | cluster_average_price | countyFIPS | homeType_CONDO | homeType_SINGLE_FAMILY | latitude | livingArea | longitude | monthlyHoaFee | price | propertyTaxRate | rentZestimate | timeOnZillow | yearBuilt | zipcode | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| annualHomeownersInsurance | 1.000 | 0.711 | 0.662 | 0.617 | 0.617 | -0.029 | 0.707 | 0.550 | -0.287 | 0.880 | -0.052 | -0.255 | 1.000 | 0.004 | 0.839 | -0.023 | 0.071 | -0.017 |
| bathrooms | 0.711 | 1.000 | 0.591 | 0.438 | 0.438 | 0.006 | 0.279 | 0.208 | -0.225 | 0.754 | 0.007 | -0.057 | 0.711 | -0.003 | 0.762 | -0.054 | 0.227 | 0.024 |
| bedrooms | 0.662 | 0.591 | 1.000 | 0.443 | 0.443 | -0.041 | 0.564 | 0.538 | -0.118 | 0.729 | 0.033 | -0.273 | 0.662 | 0.035 | 0.670 | 0.000 | -0.037 | 0.025 |
| cluster | 0.617 | 0.438 | 0.443 | 1.000 | 1.000 | 0.540 | 0.944 | 0.733 | 0.576 | 0.533 | 0.577 | 0.355 | 0.617 | 0.540 | 0.527 | 0.085 | 0.376 | 0.553 |
| cluster_average_price | 0.617 | 0.438 | 0.443 | 1.000 | 1.000 | 0.540 | 0.944 | 0.733 | 0.576 | 0.533 | 0.577 | 0.355 | 0.617 | 0.540 | 0.527 | 0.085 | 0.376 | 0.553 |
| countyFIPS | -0.029 | 0.006 | -0.041 | 0.540 | 0.540 | 1.000 | 0.000 | 0.000 | -0.176 | -0.010 | 0.176 | 0.022 | -0.029 | -0.292 | -0.048 | -0.094 | 0.065 | -0.177 |
| homeType_CONDO | 0.707 | 0.279 | 0.564 | 0.944 | 0.944 | 0.000 | 1.000 | 0.696 | 0.013 | 0.530 | 0.000 | 0.602 | 0.707 | 0.000 | 0.550 | 0.028 | 0.343 | 0.000 |
| homeType_SINGLE_FAMILY | 0.550 | 0.208 | 0.538 | 0.733 | 0.733 | 0.000 | 0.696 | 1.000 | 0.000 | 0.386 | 0.022 | 0.481 | 0.550 | 0.030 | 0.494 | 0.053 | 0.263 | 0.000 |
| latitude | -0.287 | -0.225 | -0.118 | 0.576 | 0.576 | -0.176 | 0.013 | 0.000 | 1.000 | -0.197 | 0.177 | -0.034 | -0.287 | 0.052 | -0.299 | 0.045 | -0.345 | -0.046 |
| livingArea | 0.880 | 0.754 | 0.729 | 0.533 | 0.533 | -0.010 | 0.530 | 0.386 | -0.197 | 1.000 | 0.019 | -0.210 | 0.880 | 0.014 | 0.822 | 0.008 | 0.015 | -0.019 |
| longitude | -0.052 | 0.007 | 0.033 | 0.577 | 0.577 | 0.176 | 0.000 | 0.022 | 0.177 | 0.019 | 1.000 | -0.024 | -0.052 | -0.052 | -0.061 | 0.046 | 0.049 | 0.010 |
| monthlyHoaFee | -0.255 | -0.057 | -0.273 | 0.355 | 0.355 | 0.022 | 0.602 | 0.481 | -0.034 | -0.210 | -0.024 | 1.000 | -0.255 | -0.012 | -0.182 | -0.057 | 0.153 | -0.003 |
| price | 1.000 | 0.711 | 0.662 | 0.617 | 0.617 | -0.029 | 0.707 | 0.550 | -0.287 | 0.880 | -0.052 | -0.255 | 1.000 | 0.004 | 0.839 | -0.023 | 0.071 | -0.017 |
| propertyTaxRate | 0.004 | -0.003 | 0.035 | 0.540 | 0.540 | -0.292 | 0.000 | 0.030 | 0.052 | 0.014 | -0.052 | -0.012 | 0.004 | 1.000 | 0.009 | 0.032 | -0.013 | 0.052 |
| rentZestimate | 0.839 | 0.762 | 0.670 | 0.527 | 0.527 | -0.048 | 0.550 | 0.494 | -0.299 | 0.822 | -0.061 | -0.182 | 0.839 | 0.009 | 1.000 | -0.015 | 0.127 | -0.006 |
| timeOnZillow | -0.023 | -0.054 | 0.000 | 0.085 | 0.085 | -0.094 | 0.028 | 0.053 | 0.045 | 0.008 | 0.046 | -0.057 | -0.023 | 0.032 | -0.015 | 1.000 | -0.036 | 0.019 |
| yearBuilt | 0.071 | 0.227 | -0.037 | 0.376 | 0.376 | 0.065 | 0.343 | 0.263 | -0.345 | 0.015 | 0.049 | 0.153 | 0.071 | -0.013 | 0.127 | -0.036 | 1.000 | -0.005 |
| zipcode | -0.017 | 0.024 | 0.025 | 0.553 | 0.553 | -0.177 | 0.000 | 0.000 | -0.046 | -0.019 | 0.010 | -0.003 | -0.017 | 0.052 | -0.006 | 0.019 | -0.005 | 1.000 |
Missing values
Sample
| longitude | countyFIPS | monthlyHoaFee | annualHomeownersInsurance | yearBuilt | latitude | rentZestimate | timeOnZillow | livingArea | zipcode | propertyTaxRate | bathrooms | bedrooms | price | homeType_CONDO | homeType_SINGLE_FAMILY | cluster | cluster_average_price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | -149.90807 | 2020.0 | 0.0 | 2840 | 1959.0 | 61.217308 | 3142.0 | 3609.0 | 2668.0 | 99501 | 1.31 | 2.0 | 3.0 | 676100 | 0 | 1 | 0 | 624584.137931 |
| 1 | -149.90822 | 2020.0 | 0.0 | 2934 | 1961.0 | 61.217136 | 3113.0 | 4334.0 | 3179.0 | 99501 | 1.31 | 2.0 | 3.0 | 698600 | 0 | 1 | 0 | 624584.137931 |
| 2 | -149.90833 | 2020.0 | 0.0 | 4187 | 1983.0 | 61.217000 | 4282.0 | 3758.0 | 3059.0 | 99501 | 1.31 | 3.0 | 4.0 | 996800 | 0 | 1 | 0 | 624584.137931 |
| 3 | -149.90834 | 2020.0 | 0.0 | 2920 | 1947.0 | 61.216720 | 3458.0 | 3543.0 | 1642.0 | 99501 | 1.31 | 2.0 | 5.0 | 695300 | 0 | 1 | 0 | 624584.137931 |
| 4 | -149.90749 | 2020.0 | 0.0 | 4100 | 2000.0 | 61.217120 | 4161.0 | 3953.0 | 4483.0 | 99501 | 1.31 | 4.0 | 4.0 | 976100 | 1 | 0 | 0 | 624584.137931 |
| 5 | -149.90723 | 2020.0 | 0.0 | 2535 | 2018.0 | 61.217003 | 3943.0 | 3011.0 | 2560.0 | 99501 | 1.31 | 3.5 | 3.0 | 603600 | 1 | 0 | 0 | 624584.137931 |
| 6 | -149.90723 | 2020.0 | 0.0 | 3042 | 1961.0 | 61.217140 | 3318.0 | 1512.0 | 3224.0 | 99501 | 1.31 | 3.0 | 6.0 | 724400 | 0 | 0 | 0 | 624584.137931 |
| 7 | -149.90546 | 2020.0 | 0.0 | 1865 | 1978.0 | 61.218330 | 3591.0 | 1201.0 | 2087.0 | 99501 | 1.31 | 3.0 | 2.0 | 444100 | 1 | 0 | 1 | 200110.153558 |
| 8 | -149.91057 | 2020.0 | 0.0 | 862 | 1973.0 | 61.214520 | 1945.0 | 5089.0 | 899.0 | 99501 | 1.31 | 1.0 | 2.0 | 205200 | 1 | 0 | 1 | 200110.153558 |
| 9 | -149.91037 | 2020.0 | 0.0 | 1944 | 1930.0 | 61.215305 | 2128.0 | 3672.0 | 678.0 | 99501 | 1.31 | 1.0 | 1.0 | 462800 | 0 | 1 | 3 | 377817.897679 |
| longitude | countyFIPS | monthlyHoaFee | annualHomeownersInsurance | yearBuilt | latitude | rentZestimate | timeOnZillow | livingArea | zipcode | propertyTaxRate | bathrooms | bedrooms | price | homeType_CONDO | homeType_SINGLE_FAMILY | cluster | cluster_average_price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4905 | -149.77844 | 2020.0 | 28.0 | 2314 | 1974.0 | 61.114754 | 3658.0 | 4243.0 | 2688.0 | 99516 | 1.31 | 2.0 | 4.0 | 550900 | 0 | 1 | 0 | 624584.137931 |
| 4906 | -149.78017 | 2020.0 | 28.0 | 1933 | 1974.0 | 61.114730 | 3427.0 | 3875.0 | 1872.0 | 99516 | 1.31 | 2.0 | 3.0 | 460300 | 0 | 1 | 3 | 377817.897679 |
| 4907 | -149.78795 | 2020.0 | 28.0 | 1835 | 1974.0 | 61.115578 | 3172.0 | 4057.0 | 1789.0 | 99516 | 1.31 | 2.0 | 3.0 | 436800 | 0 | 1 | 3 | 377817.897679 |
| 4908 | -149.78871 | 2020.0 | 28.0 | 1803 | 1974.0 | 61.114285 | 2845.0 | 3903.0 | 1496.0 | 99516 | 1.31 | 2.5 | 3.0 | 429400 | 0 | 1 | 3 | 377817.897679 |
| 4909 | -149.78606 | 2020.0 | 28.0 | 1916 | 1974.0 | 61.114563 | 3226.0 | 4164.0 | 1838.0 | 99516 | 1.31 | 2.0 | 3.0 | 456300 | 0 | 1 | 3 | 377817.897679 |
| 4910 | -149.78406 | 2020.0 | 28.0 | 2764 | 1978.0 | 61.115010 | 3903.0 | 1838.0 | 4263.0 | 99516 | 1.31 | 2.5 | 3.0 | 658100 | 0 | 1 | 0 | 624584.137931 |
| 4911 | -149.78296 | 2020.0 | 28.0 | 2160 | 1974.0 | 61.115078 | 3562.0 | 3925.0 | 2200.0 | 99516 | 1.31 | 2.0 | 4.0 | 514200 | 0 | 1 | 0 | 624584.137931 |
| 4912 | -149.75220 | 2020.0 | 45.0 | 2768 | 1972.0 | 61.124302 | 4917.0 | 3260.0 | 4180.0 | 99507 | 1.31 | 4.5 | 5.0 | 659000 | 0 | 1 | 0 | 624584.137931 |
| 4913 | -149.75658 | 2020.0 | 45.0 | 2979 | 1972.0 | 61.124546 | 4130.0 | 3728.0 | 3928.0 | 99507 | 1.31 | 3.0 | 5.0 | 709400 | 0 | 1 | 0 | 624584.137931 |
| 4914 | -149.75730 | 2020.0 | 45.0 | 2410 | 1974.0 | 61.124630 | 3566.0 | 3735.0 | 2576.0 | 99507 | 1.31 | 2.5 | 4.0 | 573900 | 0 | 1 | 0 | 624584.137931 |